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1, INTRODUCTION 

Over the past 20 years, new digital technologies have developed to change our way of life radically [1-3]. 
Ever thinner smartphones, regularly smaller and more powerful processors, appliances that communicate 
with each other, social media networks that know everything about us and on which we depend, have been 
developed. The industry has made significant leaps to keep pace in an increasingly demanding market [4-6]. 
At the same time, companies frequently have to deal with limited resources and constraints imposed by 
various national and international institutions. For instance, in Europe, there is the purpose of reducing 
European industry consumption by 20% by 2020 [7]. As an outcome, the need to optimize production 
processes through investments in the technological development of the factories 1s becoming increasingly 
pressing [8]. The Internet of Things (IoT) is a network of physical objects that can be reached through 
the Internet [9, 10]. These objects are daily things that contain integrated technology able to interact with 
the external environment. IoT necessitates the development of many technologies currently designed by 
the leading companies and leads to a list of ever-growing advantages [11, 12]. lol applications cover many 
features, from health [13-15] to home automation [16-19], from transport [20-24] to security [25-29]. 
For instance, by spreading a set of sensors and processors, it 1s possible to control the windows, the temperature 
of the house, the lights, and other home appliances wirelessly [30, 31]. Another application case is represented 
by the cities, which can be developed more intelligent and efficient thanks to the interconnection of smart 
objects. For instance, thanks to the IoT, the traffic lights can be connected to a camera circuit, disseminated 
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everywhere in the city, to identify the traffic level and mass movements, then avoiding possible inconveniences 
preventively [32-34]. As it is possible to perceive, the IoT opens up a world of limitless possibilities, 
more extensive than those offered a few years ago from the digital age. The creation of the IoT would have 
been much more complicated if a big data structure had not been followed since the latter allows analyzing 
vast amounts of data [35-37]. The multifaceted nature of big data is mainly due to the unstructured design 
of most of the data produced by modern technologies, such as web registers, radio frequency identification 
(RFID), sensors embedded in devices, machinery, vehicles, Internet searches, social networks, laptops, 
smartphones, GPS devices, and call center records [38-40]. 

Several studies have revealed that the use of big data represents one of the qualifying marks of 
the metamorphosis that is taking place in production processes [41-44]. Among these studies, the most notable are 
those carried by McKinsey & Company [45], by Boston Consulting, and the Milan Polytechnic Observatory [46]. 
For instance, from their investigations, it can be perceived that the fourth industrial revolution concentrates 
on the enactment of fascinating essential technologies labeled as enablers. Furthermore, big data analysis 
represents those techniques for the management of quite large measures of data, accomplished by open 
methods that yield forecasts or prognostications. The current competitive business environment is forcing 
companies to process high-speed data and integrate valuable information into production processes. As stated 
by Forrester Research [47], through interviews with experts in the field, the use cases of big data analysis 
mainly fall into three groups: 

— Efficiency and management of operational risks; the study of big data practiced to risk minimization in 
financial processes, asset management, human resources, and the supply chain 

— Security and application performance through predictive examination and big data analysis 

— Employed in information technologies monitoring; they serve to prevent problems in affording services 
and managing events in real-time. The analysis models use the datalog produced by servers and network 
devices to evaluate performance levels, find bottlenecks, and other features 

—- Knowledge and services to customers: this category includes all the solutions and applications for big 
data analysis used for marketing and sales projects, for product development, but also for optimizing 
the digital experience 

This paper focuses on delivering a short review concerning the current technologies, future 
perspectives, and the evaluation of some use cased associated with the analysis of Big Data. Section 2 
presents a summary of several case studies that have been published in this research field, while section 3 
concludes the paper. 


2. CASE STUDIES 

The first study taken into account treats the use of big data analysis aimed at improving customer 
loyalty programs [48]. In the era of cloud computing, cyber-physical systems, the Internet of Things and big 
data analysis, every user click on search engines, social networks, and e-commerce portals produce data, 
which represent for the companies an added value [49]. For this reason, the adoption of big data analysis in 
retention strategies is worthwhile not only for the attainment of new customers but also to enhance the profits 
generated by pre-existing clients, as it allows the industries to adapt their productive strategies directly to 
the needs of consumers, thus seeking to maximize revenues. The approach introduced in [48] is based on 
the use of software as a service (SaaS) to gather data from an e-commerce platform and analyze them to 
provide the users of the portal a list of recommended products. This solution, shown in Figure 1, is made up 
of an e-commerce platform, a dashboard accessible to the company’s marketing operators, an Event Bus, 
a NoSQL database, a Streaming Manager, a Machine Learning Cluster, and an Email Marketing Microservice. 

The case study is composed of a multinational corporation. Buyers perform purchases through 
the company’s e-commerce platform, producing data on their inclinations. Subsequently, a marketing 
character delivers a pre-analysis of these data to create a pre-selection of candidates for the loyalty program 
based on different criteria, which can be the geographical location, the number of orders placed or canceled, 
and the product reviews made. Subsequently, at each occurrence of a portal event, such as a purchase or review, 
the corresponding data will be directed into the Event Bus. The latter has the dual task of transferring 
unstructured information (i.e., user profiles, geolocation data, assessments, and so on) to both a NoSQL 
database and the Streaming manager. They, in turn, sends them to the Machine Learning Cluster, on which 
stays the "Recommendation" algorithm, which has the assignment of associating each candidate client with 
a list of suggested products and is executed for each new entry. These two last components work as batch 
processes. Furthermore, the interconnection and data transfer between the various parts of the SaaS is carried 
out through a scalable asynchronous bus. 
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Figure 1. System architecture 


The second study under consideration is part of the smart energy sector and concerns the use of data 
analysis to optimize energy consumption in IoT contexts. The case study proposed by the authors is based on 
the analysis of the additive manufactory (AM) production process, which uses the selective laser sintering (SLS) 
technique [50] and which consists of high energy consumption with a high production yield [51]. 
The energy consumption of the entire production process can be represented by the sum of energy 
consumption of different machines. For each complete production process, the total energy consumption 
varies due to the different production conditions and the different work environments. Furthermore, 
the power of a distinct subsystem, which is different for each subsystem, is difficult to determine with 
precision in real-time conditions. 

In the era of Industry 4.0, among the various key factors that drive industrial development, 
the attention is focused on the environmental impact that can generate particular production processes. 
Currently, it 1s estimated that industrial production activities use around 35% of the entire global electricity 
supply and that they produce approximately 20% of total carbon emissions. Besides, over the past 20 years, 
the top five manufacturing countries have experienced a 50% increase in their greenhouse gas emissions. 
For this reason, in the industrial field, there are strong interests in trying to develop new approaches that 
allow realizing sustainable production cycles but minimizing energy consumption. The institutions 
are moving in this direction, such as Europe. The new European energy legislation requires industries to 
reduce both their gas emissions and their energy use by 20% by 2020 [52]. It is recognized that the energy 
efficiency of production processes usually is less than 30% and that, for some specific methods, energy losses 
are dramatically high [53]. Consequently, the use of highly efficient energy through data analysis, in addition 
to reducing production costs and expanding profit margins, can also have a positive impact on the associated 
environmental and social problems. This condition would allow industries to fall within the constraints 
imposed by international regulations. 
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Another case study introduces an approach whose purpose is to develop an embryonic model for 
predicting energy consumption in the manufacturing industry. In particular, the problem involves the prediction 
of the energy consumption of an "Additive Manufacturing" system, because, due to the different correlations 
between various properties associated to production, the reduction of the energy consumption of this process turns 
out to be one of the most critical points of the research in this area [54, 55]. Before the coming 
of Industry 4.0, considering the limited sensors or their apparent unrelatedness, several intangible factors, 
such as the health conditions of the equipment, the competence of the personnel, the environmental 
data, the temperature of the workspace and the noise of cut, were not considered for the analysis 
of production processes. 

Nowadays, however, in an industrial environment with large sensors capable of generating massive 
amounts of data, it is possible to identify the correlation and the causal relationship of multi-source 
information to reveal important hidden factors in man-machine interactions. For instance, in the presence 
of the right lighting conditions in the production environment, it is possible to identify some critical latent 
factors in temperature, environmental humidity, cutting noise and sound field distributions, as these can 
influence the staff working efficiency, production quality, as well as energy consumption and performance 
degradation of industrial machines. Recognizing this reasonable goal, an analysis based on large data sets, 
collected from monitoring several factors, can help to improve the performance degradation prediction 
model, to identify energy consumption trends and to provide additional knowledge through feedback to 
achieve a better decision-making system concerning preventive maintenance and optimization of energy 
consumption, as shown in Figure 2. 
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Figure 2. Decision-making scheme relating to preventive maintenance, aimed at saving energy consumption 





A final case study concerns the so-called large-scale Data Ingestion of data from various devices in 
the context of Industry 4.0 [56]. With the fourth industrial revolution, the IoT has taken on a significant role 
in the development process, but this development still finds the import of heterogeneous and multi-source 
devices on a large scale as a challenge. In the industrial sector, more and more sensors are being incorporated 
into intelligent products, production equipment, and production monitoring. Smart production, which has 
become a vital component of production in the Industry 4.0 era, is facing several challenges mainly related to 
the following features: 

— Heterogeneous data: There are different types of smart devices in a company. Each device has single or 
multiple parameter sensors, with the consequent creation of heterogeneous data. The first great challenge 
is to allow interaction among different data; 

— Multi-source data: In addition to streaming data in real-time, there are also legacy applications that 
manage existing devices in companies and that continue to use legacy software despite being dated or no 
longer supported [57]. Streaming data can be analyzed in real-time. As shown in Figure 3, after the real-time 
analysis and the original application, the device data can be divided into data streams, file data, or data in 
relational databases. In this regard, the ingestion of all these heterogeneous data is the second major 
challenge to be faced; 
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— Large amounts of data: Recently, there has been an explosive growth in the category, speed, and volume 
of data [58]. With the development of intelligent manufacturing technology, it is possible to predict that 
the IoT will increase its data flows to unprecedented levels [59]. As a consequence, there will be the need 
to develop new approaches that are increasingly effective both for the resolution of the storage and for 
the querying of records in big data contexts. 
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Figure 3. Analysis of heterogeneous data 


3. CONCLUSION 

This paper aimed to present the advantages introduced by the extensive use of big data through 
the examination of different approaches in the literature. To this end, some case studies have been explained 
in which it has been revealed how the employment of data analysis brought countless benefits in investigated 
scenarios; from energy saving to preventive maintenance, up to the timely adaptation of production through 
data analysis carried out in the marketing departments of the companies. Initially, a general study focused on 
Industry 4.0, and the analysis of big data has been evaluated. Afterward, the intentions to be accomplished in 
the considered scenarios have been reported, as well as the advantages brought by the analysis of big data in 
some case studies treated in industrial applications. In this sense, some methods introduced in the literature 
have been acquainted and investigated, also presenting the results to highlight their strengths and weaknesses. 
It has been found that in the various solutions examined, the analytical approach of big data yielded 
meaningful profits. Moreover, the adoption of these solutions embodies an added value for all corporations 
that aspire to prevail competitive on the market. 

It is well known that investments in this field are crucial to pursuing this goal. Besides, from 
the analysis carried out in this work, it is reasonable to emphasize some comprehensive evaluations: concerning 
big data analysis in marketing strategies, considering that, for instance, every click of a user browsing 
the web corresponds to a generated data stream, potentially containing his/her preference, the adoption 
of data analysis could enable companies to identify trends in consumer preferences. As a result, it will be 
possible to conduct specific marketing and customer loyalty campaigns based on this added value and to 
adapt the production lines according to the compelling needs promptly; regarding big data analysis in 
the detection of production anomalies, this procedure could yield the early detection of severe trends 
and deviations in the products, which could lead to critical drawbacks in terms of economic losses and safety 
in the use of consumer products; concerning big data analysis in the prediction of the energy consumption 
of a production plant or a technological plant in green smart cities, it is crucial to emphasize how this 
component turns out to be intelligent from the environmental point of view and of the economic revenues as 
it can allow the plant operators to predict the energy consumption of the equipment and, potentially, find 
various factors that negatively affect the energy expenditure of the industrial machinery, such as failures or 
errors; in the use of big data analysis in preventive maintenance, the main advantages are measured in terms 
of economic revenues, since its application in this area may allow for taking into account various factors 
(environmental and non-environmental) that negatively influence the operation of machinery, through the 
identification of intangible factors that more often than not escape the qualified technical personnel, 
transforming themselves, consequently, into more significant revenues and even lower energy consumption; 
regarding big data analysis in risk management, this method is fundamental in decision-making and in 
predicting potentially dangerous situations if the study of these data is carried out, for instance, by artificial 
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intelligence devices which, unlike of human personnel, would allow a safe prediction based on real-time data. 
Moreover, in the automotive industry, there are more and more discussions about autonomous or driverless 
vehicles. The possibility of being able to count on automated devices that process enormous amounts of data 
in real-time can determine immense benefits to reduce any road accidents; in the case of the management 
of data coming from heterogeneous devices, considering that within a company switched to Industry 4.0 data 
flows of different types can be generated, their transformation into a unified format can lead to more efficient 
data management; regarding big data analysis combined with the use of IoT in the monitoring of industrial plants, 
the advantages consist in the opportunity of being able to implement, even for legacy industrial machinery, a 
remote monitoring system in real-time, to observe the performance of the production plant at any time and in any 
part of the world and even receive alarms regarding the occurrence of certain events. 

Anyhow, the use of data analysis in an industrial environment can also involve disadvantages. 
Indeed, the digitization of production also implies potential risks and threats in terms of cyber-attacks 
and intrusions. Furthermore, the use of dedicated hardware supports, such as distributed computing systems 
located inside the production plants, affects both economic factors and environmental impact as it will 
require double energy expenditure due to both power supplies than cooling, weighing on energy costs. 
In conclusion, it is conceivable to assert that the advantages brought by the use of the ITC in industrial 
contexts far outweigh the disadvantages and that the companies that have not yet welcomed Industry 4.0, 
in absolute terms, appear to be little competitive on the market compared to the competition. Hence, there 
should be more considerable future involvement of industrialists as the potential benefits, as repeatedly stated 
in this work, are various. Besides, thanks to the increasingly rapid technological development, in the future, 
the negative factors could be concretely reduced to a minimum, first of all, the environmental impact deriving 
from industrial production. 
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